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ATTENUATED BACTERIA USEFUL IN VACCINES 



The invention relates to attenuated bacteria useful 
in vaccines . 



Background to the invention 

The principle behind vaccination is to induce an immune 
response in the host thus providing protection against 
subsequent challenge with a pathogen. This may be 
achieved by inoculation with a live attenuated strain of 
the pathogen, i.e. a strain having reduced virulence such 
that it does not cause the disease caused by the virulent 
pathogen. 



Clasically, live attenuated vaccine strains of bacteria 
and viruses have been selected using one of two different 
methodologies. Mutants have been created either by 
treatment of the organism using mutagenic chemical 
compounds or by repeated passage of the organism in 
vitro. However, use of either method gives rise to 
attenuated strains in which the mode of attenuation is 
unclear. These strains are particularly difficult to 
characterise in terms of possible reversion to the wild 
type strain as attenuation may reflect single (easily 
reversible) or multiple mutation events. Furthermore, it 
is difficult to obtain such strains having optimum 
immunogenic properties because of multiple mutation 
events, and multiple strains may need to be used to 
provide protection against the pathogen. 



Using modern genetic techniques, it is now possible to 
construct genetially defined attenuated bacterial strains 
in which stable attenuating deletions can be created. A 
number of site directed mutants of Salmonella have been 
created using this type of technology (2, 4, 5, 9, 12, 
.16, 17, 18) . Mutations in a large number of genes have 



been reported to be attenuating, including the aro genes 
(e.g. aroA, aroC, aroD and aroE) , pur, AtrA, ompR, ompF, 
ompC, galE, cya , crp and phoP. 

Salmonella aroA mutants have now been well characterised 
and have been shown to be excellent live vaccines against 
salmonellosis in several animal species. In addition, in 
order to reduce the chances of a reversion to virulence 
by a recombination event, mutations have been introduced 
into two independent genes such as aroA/purA and 
aroA/aroC. Identical mutations in host adapted strains 
of Salmonella such as S.. typhi (man) and S.dublin (cattle) 
has also resulted in the creation of a number of 
candidate single dose vaccines which have proved success- 
ful in clinial (8, 11) and field trials (10) . 

A Salmonella typhimurium strain harboring stable 
mutations in both ompC and ompF is described in Chatfield 
et al (1991, ref. 21) . When administered orally to BALB/c 
mice the strain was attenuated, with the 50% lethal dose 
(LD50) reduced by approximately 1, 000-fold. However, the 
intravenous LD50 was reduced only by approximately 10- 
fold, demonstrating the importance of the porins in 
confering on the bacteria the ability to infect by the 
oral route . 

Expression of the ompC and ompF genes is regulated by 
ompR. Pickard et al (1994, ref. 13) describes the cloning 
of the ompB operon, comprising the ompR and envZ genes, 
from a Salmonella typhi Ty2 cosmid bank and 
characterisation by DNA sequence analysis. The DNA 
sequence data were used to identify appropriate 
restriction sites for generating a defined deletion of 
517 bp within the open reading frame of the ompR gene. 



This deletion was introduced by homologous recombination 
into the chromosomes of two S. typhi strains which already 
harbored defined deletions in both the aroC and aroD 
genes. The S. typhi ompR mutants displayed a marked 
decrease in ompC and ompF porin expression as 
demonstrated by examination of outer membrane 
preparations. It was also shown that the ompR-envZ two 
component regulatory system plays an important role in 
the regulation of Vi polysaccharide synthesis in S. typhi. 

In animal studies, attenuated S . typhimurium has been used 
as a vehicle for the delivery of heterologous antigens to 
the immune system (3, 6, 15) . This raises the potential 
of the development of multivalent vaccines for use in man 
(7) . 



Summary of the Invention 

The invention provides a bacterium attenuated by a non- 
reverting mutation in each of the aroC gene, the ompF 
gene and the ompC gene. The invention also provides a 
vaccine containing the bacterium. 



It is believed that the aroC/ompF/ompC combination of 
mutations gives a vaccine having superior properties. For 
example, it is believed that the aroC/ompF/ompC 
combination may be superior to a aroC/ompR combination 
for two reasons : 



1. The ompR mutation may cause higher levels of 
attenuation than the ompF/ompC combination of 
mutations because ompR may regulate a number of 
genes other than ompF and ompC which are important 
for survival of the bacterium in vivo. Thus, the 
ompF/ompC combination may allow the bacterium to 
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survive in the vaccinated host for a longer time 
and at higher levels, resulting in better 
protection . 

5 2 . The ompR mutation may cause reduced immunogenici t y 
compared to the ompF/ompC combination of mutations 
because ompR may regulate the expression of 
antigens important for immunogenici ty . 

10 Detailed Description of the Invention 

Bacteria useful in the Invention 

The bacteria that are used to make the vaccines of the 
invention are generally those that infect by the oral 
15 route. The bacteria may be those that invade and grow 

within eukaryotic cells and/or colonise mucosal surfaces. 
The bacteria are generally Gram-negative. 

The bacteria may be from the genera Escherichia, 
20 Salmonella, Vibrio, Haemophilus, Neisseria, Yersinia, 
Bordetella or Brucella. Examples of such bacteria are 
Escherichia coli - a cause of diarrhoea in humans; 
Salmonella typhimurium - the cause of salmonellosis in 
several animal species; Salmonella typhi - the cause of 
25 human typhoid; Salmonella enteritidis - a cause of food 

poisoning in humans; Salmonella choleraesuis - a cause of 
salmonellosis in pigs; Salmonella dublin - a cause of 
both a systemic and diarrhoel disease in cattle, 
especially of new-born calves; Haemophilus influenza - a 
30 cause of meningitis; Neisseria gonorrhoeae - a cause of 
gonorrhoeae; Yersinia enterocolitica - the cause of a 
spectrum of diseases in humans ranging from 
gastroenteritis to fatal septicemic disease; Bordetella 
pertussis - the cause of whooping cough; and Brucella 



abortus - a cause of abortion and infertility in cattle 
and a condition known as undulant fever in humans. 

Strains of E.coli and Salmonella are particularly useful 
in the invention. As well as being vaccines in their own 
right against infection by Salmonella, attenuated 
Salmonella can be used as carriers of heterologous 
antigens from other organisms to the immune system via 
the oral route. Salmonella are potent immunogens and are 
able to stimulate systemic and local cellular and 
antibody responses. Systems for driving expression of 
heterologous antigens in Salmonella in vivo are known; 
for example the nirB and htrA promoters are known to be 
effective drivers of antigen expression in vivo. 

The invention may be appied to enterotoxigenic E.coli 
( M ETEC" ) . ETEC is a class of E.coli that cause 
diarrhoea. They colonise the proximal small intestine. 
A standard ETEC strain is ATCC H10407. 

Infections of ETEC are the single most frequent cause of 
travellers diarrhoea, causing 3-9 million cases per year 
amongst visitors to developing countries. In endemic 
areas, ETEC infections are an important cause of 
dehydrating diarrhoea in infants and young children, 
resulting in 800,000 deaths a year in the under fives 
world-wide. In developing countries, the incidence of 
ETEC infections leading to clinical disease decreases 
with age, indicating that immunity to ETEC infection can 
be acquired. In contrast, naive adults from 
industrialized countries who visit endemic areas are 
highly susceptible to ETEC infections. However, with 
prolonged or repeated visits to endemic areas 
susceptibility to ETEC infections diminishes, suggesting 
that a live attenuated approach to ETEC vaccination may 
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prove successful. 

The inventors chose to work on a non- toxigenic strain of 
ETEC called E1392/75/2A. E1392/75/2A arose spontaneously 
5 from a toxic mutant by deletion of toxin genes. In human 
studies, oral vaccination with live E1392/75/2A gave 75% 
protection against challenge with toxin-expressing ETEC 
from a different serotype. However, approximately 15% of 
vaccinees experienced diarrhoea as a side effect of the 
10 vaccine. The strain needs further attenuation to reduce 
the side effects before it can be considered as a 
potential vaccine and the invention gives a means of 
achieving such attenuation. 

15 Seq Id No. 1 shows the sequence of the E.coli aroC gene, 
Seq Id No. 3 shows the sequence of the E.coli ompC gene 
and Seq. Id No. 5 shows the sequence of the E.coli ompF 
gene . 

2 0 Further mutations 

One or more further mutations may be introduced into the 
bacteria of the invention to generate strains containing 
mutations in addition to those in aroC, ompC and ompF. 

25 Such a further mutation may be (i) an attenuating 

mutation in a gene other than aroC, ompC and ompF, (ii) a 
mutation to provide in vivo selection for cells 
maintaining a plasmid (e.g. a plasmid expressing a 
heterologous antigen), or (iii) a mutation to prevent 

30 expression of a toxin gene. 



The further attenuating mutation may be a mutation that 
is already known to be attenuating. Such mutations 
include mutations in aro genes (e.g. aroA, aroD and 



aroE) , pur, htrA, ompR, galE, cya, crp, phoP and surA 
(see e.g. refs 2, 4, 5, 9, 12, 13, 16, 17 and 18). 

A mutation to provide selection for maintenance of a 
plasmid may be made by mutating a gene that is essential 
for the bacterium to survive. A plasmid carrying the 
essential gene is then introduced into the bacterium, so 
that only cells carrying the plasmid can survive. This 
may be useful where the plasmid contains, for example, a 
heterologous antigen to be expressed by the bacterium. 

A mutation to prevent expression of a toxin gene may be 
made to reduce any side-effects caused by vaccination 
with the bacterium. For example, in the case of 
vaccination with E.coli strains such as ETEC it may be 
desirable to mutate the heat labile toxin (LT) or heat 
stable toxin (ST) genes so that they are not expressed. 

The nature of the mutations 

The mutations introduced into the bacterial vaccine 
generally knock-out the function of the gene completely. 
This may be achieved either by abolishing synthesis of 
any polypeptide at all from the gene or by making a 
mutation that results in synthesis of non-functional 
polypeptide. In order to abolish synthesis of 
polypeptide, either the entire gene or its 5 1 -end may be 
deleted. A deletion or insertion within the coding 
sequence of a gene may be used to create a gene that 
synthesises only non-functional polypeptide (e.g. 
polypeptide that contains only the N-terminal sequence of 
the wild-type protein) . 

The mutations are non-reverting mutations. These are 
mutations that show essentially no reversion back to the 
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wild-type when the bacterium is used as a vaccine. Such 
mutations include insertions and deletions. Insertions 
and deletions are preferably large, typically at least 10 
nucleotides in length, for example from 10 to 600 
5 nucleotides. Preferably, the whole coding sequence is 
deleted . 

The bacterium used in the vaccine preferably contains 
only defined mutations, i.e. mutations which are 
10 characterised. It is clearly undesirable to use a 

bacterium which has uncharacter ised mutations in its 
genome as a vaccine because there would be a risk that 
the uncharacterised mutations may confer properties on 
the bacterium that cause undesirable side-effects. 

15 

The attenuating mutations may be introduced by methods 
well known to those skilled in the art (see ref. 14). 
Appropriate methods include cloning the DNA sequence of 
the wild-type gene into a vector, e.g. a plasmid, and 

20 inserting a selectable marker into the cloned DNA 
sequence or deleting a part of the DNA sequence, 
resulting in its inactivation. A deletion may be 
introduced by, for example, cutting the DNA sequence 
using restriction enzymes that cut at two points in or 

25 just outside the coding sequence and ligating together 
the two ends in the remaining sequence. A plasmid 
carrying the inactivated DNA sequence can be transformed 
into the bacterium by known techniques such as 
electroporation and conjugation. It is then possible by 

30 suitable selection to identify a mutant wherein the 
inactivated DNA sequence has recombined into the 
chromosome of the bacterium and the wild-type DNA 
sequence has been rendered non-functional by homologous 
recombination . 



35 
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Expression of heterologous antigens 

The attenuated bacterium of the invention may be 
genetically engineered to express an antigen that is not 
expressed by the native bacterium (a "heterologous 
5 antigen"), so that the attenuated bacterium acts as a 

carrier of the heterologous antigen. The antigen may be 
from another organism, so that the vaccine provides 
protection against the other organism. A multivalent 
vaccine may be produced which not only provides immunity 

10 against the virulent parent of the attenuated bacterium 
but also provides immunity against the other organism. 
Furthermore, the attenuated bacterium may be engineered 
to express more than one heterologous antigen, in which 
case the heterologous antigens may be from the same or 

15 different organisms. 

The heterologous antigen may be a complete protein or a 
part of a protein containing an epitope. The antigen may 
be from another bacterium, a virus, a yeast or a fungus. 

20 More especially, the antigenic sequence may be from 

E.coli (e.g. ETEC) , tetanus , hepatitis A, B or C virus, 
human rhinovirus such as type 2 or type 14, herpes 
simplex virus, poliovirus type 2 or 3, foot-and-mouth 
disease virus, influenza virus, coxsackie virus or 

25 Chlamydia trachomatis . Useful antigens include non-toxic 
components of E.coli heat labile toxin, E.coli K88 
antigens, ETEC colonization factor antigens, P. 69 protein 
from B .pertussis and tetanus toxin fragment C. 

30 The ETEC colonization factors and components thereof are 
prime candidates for expression as heterologous antigens. 
To instigate diarrhoeal disease, pathogenic strains of 
ETEC must be able to colonize the intestine and elaborate 
enterotoxins . For most strains of ETEC colonization 

35 factors (CF) that are required for adhesion to the 
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intestinal mucosa have been identified. In almost all 
cases CFs are expressed as fimbrae on the outer surface 
of the bacteria. A large number of CFs have been 
identified, the most prevalent being CFAI, CRAII 
5 (includes CS1, CS2, CS3) and CFAIV (includes CS4 , CSS, 

CS6) . 

A vaccine to ETEC will ideally give protection against a 
range of colonization factor antigens to ensure that 
protection against different strains is obtained. In 
order to achieve this, it would be possible to express 
several colonization factors in one strain. 
Alternatively, the same attenuations could be made in a 
range of different ETEC strains, each with a different 
colonization factor. This would involve deleting the 
toxins from such strains. 

The DNA encoding the heterologous antigen is expressed 
from a promoter that is active in vivo. Two promoters 
20 that have been shown to work well in Salmonella are the 
nirB promoter (19, 20) and the htrA promoter (20) . For 
expression of the ETEC colonization factor antigens, the 
wild-type promoters could be used. 

25 A DNA construct comprising the promoter operably linked 
to DNA encoding the heterologous antigen may be made and 
transformed into the attenuated bacterium using 
conventional techniques. Transf ormants containing the DNA 
construct may be selected, for example by screening for a 

30 selectable marker on the construct. Bacteria containing 
the construct may be grown in vitro before being 
formulated for administration to the host for vaccination 
purposes . 



10 
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Formulation of the vaccine 

The vaccine may be formulated using known techniques for 
formulating attenuated bacterial vaccines. The vaccine is 
advantageously presented for oral administration, for 
example in a lyophilised encapsulated form. Such capsules 
may be provided with an enteric coating comprising, for 
example, Eudragate ,f S" (Trade Mark), Eudragate M L" (Trade 
Mark) , cellulose acetate, cellulose phthalate or 
hydroxypropylmethyl cellulose. These capsules may be used 
as such, or alternatively, the lyophilised material may 
be reconstituted prior to administration,, e.g. as a 
suspension. Reconstitution is advantageously effected in 
a buffer at a suitable pH to ensure the viability of the 
bacteria. In order to protect the attenuated bacteria and 
the vaccine from gastric acidity, a sodium bicarbonate 
preparation is advantageously administered before each 
administration of the vaccine. Alternatively, the vaccine 
may be prepared for parenteral administration, intranasal 
administration or intramuscular administration. 

The vaccine may be used in the vaccination of a mammalian 
host, particularly a human host but also an animal host. 
An infection caused by a microorganism, especially a 
pathogen, may therefore be prevented by administering an 
effective dose of a vaccine prepared according to the 
invention. The dosage employed will ultimately be at the 
discretion of the physician, but will be dependent on 
various factors including the size and weight of the host 
and the type of vaccine formulated. However, a dosage 
comprising the oral administration of from 10 7 to 10 11 
bacteria per dose may be convenient for a 70 kg adult 
human host. 



Experimental section 

The experiments described in this section serve to 



illustrate the invention. 

Brief description of the drawings 



Figure 1 shows a system for constructing defined 
deletions in target genes using splicing by overlay 
extension PCR mutagenesis. 

Figure 2 shows the expected sequences of target genes 
after recombination and selection for deletions. 

Figure 3 shows the cloning of deletion cassettes into 
plasmid pCVD442 . 

Figure 4 shows an SDS-PAGE analysis of outer membranes 
prepared from ETEC strains under conditions of low (no 
salt L-broth) and high (no salt L-broth + 15% sucrose) 
osmolarity. M = markers; Sample 1 = PTL010; Sample 2 - 
PTL002; Sample 3 = PTL003; Sample 4 = AaroCAompC; Sample 
5 = AompF . 

Figure 5 shows expression of CS1 and CS3 in deletion 
strains after growth on CFA agar. Equal numbers of cells 
from each strain were loaded on a 15% SDS-PAGE gel and 
Western blotted with monospecific anti-CSl or anti-CS3 
polyclonal antibodies. Controls for antibody specificity 
were whole cesll lysates of TGI cells expressing the 
majore pilin protein of CSl, or purified major pilin 
protein from CS3. Lane M, rainbow low molecular mass 
markers; lane 1, induced TGI cells harbouring pKK223; 
lane 2, induced TGI cells harbouring pKKCsl; lane 3, CSl- 
ETEC strain; lane 4, PTL010; lane 5, PTL001; lane 6, 
PTL002; lane 7, PTL003; lane 8, purified CS3 major pilin 
protein. + 



Figure 6 shows a Southern blot of mutant loci. 
Chromosomal DNA was extracted from the wild-type ETEC 

(E1392/75-2A) , PTL001 (htrA aroC), PTL002 (aroC ompR) and 
PTL003 (aroC ompC ompF) as indicated, digested with 
restriction endonuclease .EcoRV, and pulsed field 
electrophoresed through 1% agarose. DNA was blotted from 
the gel onto Hybond N+ nylon membranes (Amersham) and 
hybridised with DNA probes derived from the aroC, htrA, 
ompR, ompC, or ompF loci as shown. The banding patterns 
are consistent with the mutant loci being deletions. 

Design of deletions and construction of plasmids 
pCVD AAroC. oCVD AOzwdC and -pCVDAOmpF 

Deletions were designated to remove the . entire open 
reading frame of the target gene. Using the E.coli genome 
sequence as a template, PCR primers were designed to 
amplify fragments of 500-600 base- pairs flanking the 
target open reading frame (see Table 1 for primer 
sequences) . Splicing by overlap extension using PCR was 
used to fuse the two flanking sequences, creating a PCR 
product with the entire gene deleted (Figure - 1) . The 
wild-type sequences around the deletion site and the 
predicted sequences after deletion are depicted in Figure 
2 . 

For each gene two different restriction sites were 
introduced into the splice region (see Table 2 below) . 
These were used for identification of deletion clones. 
The PCR primers at either end of the PCR fragment 
introduced unique restriction sites that were used to 
clone the fragment into the multiple cloning site of 
pCVD4 4 2 (Figure - 3) . 

PCR products were gel purified using a Qiagen (Trade 
Name) gel extraction kit and digested with the relevant 



restriction enzymes prior to ligation to the suicide 
plasmid pCVD442(22) digested with the same enzyme and 
treated with alkaline phosphatase to prevent vector self- 
ligation (Figure - 3). The ligation mix was transformed 
into SY327Xpir and plated on L-Ampicillin (100 pg/ml) 
plates. Plasmids from Ampicillin resistant t ransf ormants 
were screened for the presence of the deletion cassettes 
by restriction digestion. The following plasmids were 
generated : 

pCVDAAroC 
pCVDAOmpC 
pCVDAOmpF 

The suicide plasmid pCVD442 can only replicate in cells 
harboring the pir gene. On introduction into non-pir 
strains, pCVD442 is unable to replicate, and the 
Ampicillin resistance conferred by the plasmid can only 
be maintained if the plasmid is integrated in the 
chromosome by a single homologous recombination event. 
The plasmid also has a sacB gene, encoding levan sucrase, 
which is toxic to gram negative bacteria in the presence 
of sucrose. This can be used to select clones that have 
undergone a second recombination event, in which the 
suicide plasmid is excised. Such cells will be resistant 
to sucrose, but Ampicillin sensitive. 

Construction and characterisation of AAjroCAOmpCAOnupF 
strain 

This section outlines the chronology of construction and 
history of a AAroCAOmpCAOmpF strain. In the section, 
"ETEC" refers specifically to strain E1392/75/2A or its 
derivatives . 



AAroCAOmpCAOmpF deletions were introduced into 
E1392/75/2A in the following order: 
AAroC-AAroCAOmpC-AAroCAOmpCAOmpF 

Construction of ETECAAr-oC 

1) E1392/75/2A from original microbanked stock was 
plated onto L-Agar. 

2) Electroporation competent cells were prepared from 
these cells. 100 pi aliquots were frozen. 

3) pCVDAAroC was purified from SY327pir cells using a 
Qiagen Qiafilter (Trade Name) midiprep. The plasmid 
was concentrated about 10-fold by ethanol 
precipitation. The construction of pCVDAAroC is 
described above. 

4) 5 pi of concentrated plasmid was mixed with 100 pi 
defrosted cells and electroporated . The whole 
transformation was plated on an L-Ampicillin plate 
(50 pg/ml) and incubated overnight at 37°C. 

5) A single Ampicillin resistant colony grew. 

6) The colony was streaked onto an L-Ampicillin plate 
(100 pg/ml) and grown overnight at 37°C 

( "merodiploid plate") . 

7) PCR using primers TT19 and TT20 (specific for the 
aroC gene) and a colony picked from the merodiploid 
plate amplified two bands, with sizes corresponding 
to that of the wild-type and AaroC genes. The 
seguences of the primers are shown in Table 1 
below. 

8) A colony from the merodiploid plate was grown up 
for 7 hr in a) L-Ampicillin broth (100 pg/ml) and 
b) L-Broth. The colony grown on L-Ampicillin was 
microbanked . 

9) Serial dilutions of the L-broth culture were set up 
on : 

a) No salt L-agar 
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10 



15 



10) 



11) 



12) 
13) 



b) No salt L-agar + 5% sucrose. 
The plates were incubated overnight at 30°C. 
Colony counts showed that 10 4 more colonies grew on 
L-agar than on L-agar + 5% sucrose, showing sucrose 
selection worked. 

Sucrose resistant colonies were screened for the 
presence of AaroC gene by PCR. Colonies chosen for 
screening were picked onto an L-agar plate and 
grown overnight at 37°C. This plate was stored at 
4°C, whilst further tests were carried out. 
50% of 90 colonies tested had AaroC only. 
Colonies were tested for growth on: 

a) M-9 minimal media plates 

b) M-9 minimal media + Aromix plates 

c) L-Amp (100 pg/ml) 

AaroC colonies should not grow on M-9 minimal media 
without Aromix or on L-Amp. 

Aromix is a mix of aromatic compounds as follows: 



20 
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Substance 


Final concentration 
(% w/v) 


Phenylalanine 


0.004 


Tryptophan 


0.004 


Tyrosine 


0.004 


p-aminobenzoic acid 


0.001 


dihydroxybenzoic acid 


0.001 



These compounds are made in wild-type bacteria, but 
the aroC mutation prevents their synthesis. 

14) 13/14 putative AAroC colonies required Aromix for 
30 growth on M-9 minimal media and were susceptible to 

Ampicillin . 

15) 3 colonies (No. 1,2,3) were tested for the presence 
of the CS1 major pilin protein gene by PCR using 



primers MGR169 and MGR170. All 3 colonies gave PCR 
products of the expected size (700 bp.). The 
sequences of the primers are shown in Table 1. 

16) Colonies 1, 2 and 3 from screening master plate 
were streaked onto L-Agar and grown overnight at 
37°C. Cells from these plates were used to 
inoculate microbank tubes. 

17) Colony 1, stored in a microbank, was used for 
further work. 

18) For permanent storage, a bead from the microbank 
tray was inoculated into 1 ml L-broth, grown for 4 
hr with shaking at 37°C and used to make agar 
slopes which were used to make freeze dried stocks. 
The freeze dried stock of E1392/75/2AZlAroC was 
designated PTL004 . 20 ml of L-broth was added to 
the rest of the 1 ml culture and the culture was 
incubated overnight at 30°C. 1 ml of the overnight 
culture was transferred to each of three cryovials 
and stored in liquid nitrogen. 

Construction of ETEC AAroCAOmpr 

1) Preparation of pCVDAOmpC plasmid DNA for 

electroporation : 

A colony of SY327Xpir harbouring pCVDAOmpC was 
grown overnight at 37°C in 100 ml L-Ampicillin 
broth 

(100 pg/ml) . Plasmid DNA was purified using 2 
Qiagen Qiafilter (Trade Name) midipreps . DNA was 
further concentrated by ethanol precipitation. The 
construction of pCVDAOmpC is described above. 
2) Preparation of electrocompetent cells: 

ETECAAroC cells from the microbank tray produced in 
step 17 of the preceding section were streaked on 
L-agar, grown at 37°C overnight and then stored at 
4°C for no more than 1 week before being used to 



inoculate cultures for preparing elect rocompetent 
cells . 

ETECAAroC cells were electroporated with 5 pi of 
concentrated pCVDAOmpC DNA, and each transformation 
plated on a single L-Ampicillin plate (50 pg/ml) 
and grown overnight at 37°C. 

17 Ampicillin resistant colonies (putative 
ETECZlAroC/ pCVDAOmpC merodiploids ) were obtained. 
These colonies were spotted onto a master L- 
Ampicillin (100 yig/ml) plate and used as templates 
for PCR with primers TT7/TT8. The master plate was 
grown at room temperature over the weekend. The 
sequences of the primers are given in Table 1 
below . 

A single colony (No. 7) had the AompC gene. 

The colony was grown for 5 hr in L-broth. 

Serial dilutions of the L-broth culture were set up 

on : 

a) No salt L-agar 

b) No salt L-agar + 5% sucrose. 
The plates were incubated overnight at 30°C. 
Colony counts showed that 10 4 more colonies grew on 
L-agar than on L-agar + 5% sucrose, showing sucrose 
selection worked. 

45 sucrose resistant colonies were screened for 
AompC by PCR using primers TT7 and TT8 . 9 colonies 
had the AompC gene, but most had traces of w.t. 
ompC gene. The sequences of the primers are given 
in Table 1 below. 

To further characterise putative ETECAAroCAOmpC 
colonies, they were grown in 1 ml L-Broth for 5 hr 
and plated on : 

a) L-Agar * 100 pg/ml Ampicillin 

b) L-Agar ^ 

c ) L-Agar + 5% sucrose 



AOmpC colonies should be resistant to sucrose and 
sensitive to Ampicillin. 

12) Only 1 colony (No. 1) was Ampicillin sensitive and 
sucrose resistant . 

13) Colony 1 was checked for the presence of AaroC, 
AompC and CS1 genes by PCR with primers TT19/TT20, 
TT7/TT8 and MGR169 and 170. The sequences of the 
primers are given in Table 1 below. 

14) Colony 1 gave single PCR products of the expected 
size for AaroC, AompC and CS1 genes. 

15) The colony was microbanked. 

16) For permanent storage, a bead from the microbank 
was inoculated into- 1 ml L-broth, grown for 4 hr 
with shaking at 37°C and used to make agar slopes 
which were freeze dried. The freeze dried stock of 
E1392 /7 5 /IKAAroCAOmpC was designated PTL008 . 20 ml 
of L-broth was added to the rest of the 1 ml 
culture and the culture was incubated overnight at 
30°C. 1 ml of the overnight culture was transferred 
to each of three cryovials and stored in liquid 
nitrogen. 



Construction of ETEC&AroCAOmpCAOmpIT 

Conjugation was used to introduce pCVDAOmpF into 
E1392/75/ 2AAAroCAOmpC . 

1) Conjugation donor cells SMIOXpir were transformed 
with pCVDAOmpF. The construction of plasmid 
pCVDAOmpF is described above. 

2) ETEC A AroCAOmpC cells were conjugated with SMIOXpir/ 
pCVDAOmpF cells. The pCVD442 plasmid includes a 
transfer origin which allows the plasmid to be 
transferred from a donor strain containing the RP4 
transfer genes (e.g. SMIOXpir) to a recipient 
strain (e.g. ETEC) . ETECAa toCa ompC cells and 
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E.coli strain SMlOApir harbouring the PcvdAompF 
recombinant were cross-streaked on L-agar plates so 
as to cover an area of approximately 10 cm 2 . 
Plates were incubated at 37° C for 20 h, then the 
5 growth washed off using 4 ml L-broth and the 

suspension plated onto McConkey agar (Difco) 
containing streptomycin at 20^g ml" 1 and ampicillin 
at 300/^g ml" 1 . Plates were incubated overnight at 
37°C and resulting colonies were checked for 
10 merodiploidy by PGR using appropriate 

oligonucleotides as primers. 

3) Putative ETEC t ranscon j ugant s were screened. 10 
colonies were picked from McConkey agar plates and 
grown overnight on L-Ampicillin (100 pg/ml) agar. 

15 The presence of AompF gene was checked for by PCR 

with primers TT1/TT2. The sequences of the primers 
are given in Table 1 below. 

4) The colonies were grown for 5 hr in L-broth. 

5) Serial dilutions of the L-broth culture were set up 
20 on: 

a) No salt L-agar 

b) No salt L-agar + 5% sucrose. 
The plates were incubated overnight at 30°C. 

6) Colony counts showed 10 5 more colonies grew on L- 
25 agar than on L-agar + 5% sucrose, showing sucrose 

selection worked. 

7) Sucrose resistant colonies were screened for AompF 
gene by PCR with primers TT1/TT2. The sequences of 
the primers are given in Table 1 below. The 

30 screened colonies were grown overnight on L-Agar. 3 

colonies out of 47 had the AompF gene with no 
evidence of the wild-type ompF gene. 

8) To further characterise putative 
ETECAAroCAOmpCAOmpF colonies, they were plated on: 

35 a) L-Agar + 100 pg/ml Ampicillin 



b) L-Agar 

c) L-Agar + 5% sucrose 

AompF colonies should be resistant to sucrose and 
sensitive to Ampicillin . 

9) All three AompF colonies were Ampicillin sensitive 
and sucrose resistant. 

10) The colonies were microbanked and one colony was 
chosen as a master stock. 

11) For permanent storage, a bead from the master stock 
was inoculated into 1 ml L-broth, grown for 4 hr 
with shaking at 37°C and used to make agar slopes 
which were used to make freeze dried stocks. The 
freeze dried stock of E1392/75/ 2AAaroCAompCAompF 
was designated PTL003. 20 ml of L-broth was added 
to the rest of the 1 ml culture and the culture was 
incubated overnight at 30°C. 1 ml of the overnight 
culture was transferred to each of three cryovials 
and stored in liquid nitrogen. 

Characterisation of E1392 /7 5 / 2&AAroCAOmpCAOmpF 

1) Growth requirements: 

Cells taken from the master stock produced in step 
10 of the preceding section were streaked on L-Agar 
plate. At the same time 8 ml L-broth was inoculated 
for a chromosomal DNA prep for Southern blots. Both 
plate and liquid culture were grown overnight at 
37°C. 

Cells from the grown plate were streaked onto the 
following media and grown overnight at 37°C. 

Medium Gro 

wth 

L-Amp 

No 

M9 minimal media 

No 



Yes 



No 



Yes 



Yes 



10 Yes 
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M9 minimal + Aromix 

M9 + sulf athiazole (100 pg/ml) 

M9 + sulf athiazole (100 pg/ml) + Aromix 

L-Agar + 50 pg/ml streptomycin 

L-Agar + 5% sucrose 



As expected, the cells were Amp sensitive. The 
cells were resistant to sucrose, streptomycin and 
sulf athiazole, but required Aromix to grow on 
minimal media . 
15 2) LPS analysis of PTL003: 

a) A freeze dried vial of PTL003 was broken 
open. The culture was resuspended in L-Broth 
and plated on 

L-Agar for growth. Some cells were scraped 
20 off and stored in microbank. 

b) More cells were scraped off and the LPS 
profile was analysed. There was no visible 
difference between the LPS profile of PTL003 
and original E1392/75/2A strain. 

25 3) Confirmation of deletions by PCR: 

a) A scrape of cells was taken from the plate 
made in in 2a and streaked onto L-Agar and 
grown overnight. 

b) Freshly grown cells were used for PCR with 

30 primers that flank the following genes: aroC, 

htrA, ompC, ompF, ompR. 

c) PTL003 was shown to have deletions in aroC, 
ompC and ompF .genes, but not in htrA or ompR. 

4) Outer membrane protein preps of PTL003: 

35 a) An outer membrane prep of PTL003 was made 



using cells from step 3a above, 
b) The outer membrane fraction was frozen and 
gels run (Figure 4). 

5) Checking expression of CS1 and CS3 in PTL003: 

a) The plate from 3a above was used. A colony of 
PTL003 and PTL010 (E1392/75/2A freeze dried 
stock) were grown up for 4 hr in L-Broth. 2 
pi was dotted on each of four CFA-Agar plates 
and grown overnight at 37°C or 18°C. 

b) The 37°C and 18°C plates were blotted with 
anti-CSl and anti-CS3 antibodies. 

c) The results are shown in Figure 5. No CS1 or 
CS3 expression was seen at 18°C with either 
PTL010 or PTL003. Both PTL010 and PTL003 
expressed CS1 and CS3 at 37°C. If anything 
PTL003 expressed slightly more CS1, but this 
may reflect different cell numbers or sticki- 
ness to nitrocellulose rather than 
differences in pili/cell. 

6) Southern blotting of PTL003: 

Structure of deletion mutations . Total DNA was 
extracted from cultures of the three deletion mutants 
grown from the microbanked stocks, digested with 
restriction endonuclease EcoRV, and the digested DNA 
subjected to pulsed field agarose gel electrophoresis. 
DNA was blotted from the gels onto Hybond N+ (Trade Name) 
nylon membranes and hybridised with appropriate DNA 
probes according to standard procedures. Results (Figure 
6) show that the hybridising chromosomal DNA fragments of 
the mutants are shorter than the wild-type, consistent 
with the mutations being deletions. Confirmation of 
absence of Heat-Stable (ST) and Heat-Labile (LT) toxin 
genes in E.coli strain E1392/75-2A. For this the ST and 
LT-AB genes were used as DNA probes against total DNA 
from E1392/75-2A. Total D'NA from the toxin positive ETEC 
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strain E1393/75 was included as a positive control, while 
that from the laboratory E.coli strain JM109 was included 
as a negative. Hybridised membranes were left under 
Hyperf ilm-ECL (Trade Name) for 1 h to obtain the maximum 
5 amount of signal. Probes were prepared using PCR with 
plasmid DNA extracted from E1392/75-2A as template and 
oligonucleotides EST01 and EST02 as primers for ST, or 
LT-R1 and LT-03 for LT-AB. There was no significant 
hybridisation with total DNA using either the LT-AB or 
10 the ST probe, despite obtaining a very intense signal 
from the positive control total DNA. 

Confirmation of absence of pCVD442 sequences from the 
chromosome of deletion mutants . The plasmid pCVD442 was 
labelled and hybridised to total DNA from deletion 

15 mutants PTL001, PTL002 and PTL003 digested with EcoRV. 

Total DNA from ETEC strain E1392/75-2A was included as a 
control. A complex pattern of hybridising DNA fragments 
was obtained. But, there was no significant difference 
between the pattern obtained for the wild-type and that 

20 for the mutants, indicating that probably no residual 

pCVD442 nucleotide sequences were left in the genomes of 
the mutants. The complex pattern of hybridising 
fragments was most likely due to the pCVD442 probe 
hybridising with the plasmid DNA components of the 

25 E1392/75-2A strain and mutant derivatives. 

Analysis of outer membrane protein profile of PTL003: 
Outer membrane protein fractions were prepared from 
strains PTL010 (E1392/75/2A) and the deletion strains 

30 PTL002 and PTL003 . A strain with a single ompF deletion 
and a strain with both aroC and ompC deletion were also 
analysed. Strains were grown under conditions of low 
osmolarity (no salt L-broth) and high osmolarity (no salt 
L-broth+15% sucrose) . The OmpF protein product is 

35 normally expressed at low osmolarity whereas the OmpC 



-25- 



product is expressed at high osmolarity. The OmpC and 
OmpF proteins have similar elec troporetic mobilities. At 
both high and low osmolarities, the strain PTL003 lacks 
proteins in the OmpC/OmpF region when compared to the 
wild-type E1392/75/2A strain or to the AAroCAOmpC or 
a OmpF deletion strains. The results are shown in Figure 
4 . 

Expression of CS1 and CS3 pili on CFA agar: 
The expression of CS1 and CS3 pili in the deletion 
strains was examined. Equal numbers (2 A 600nm units) of 
bacteria strains PTL010, PTL001, PTL002 and PTL003 grown 
overnight at 37°C on CFA agar were subjected to SDS PAGE 
and analysed by Western blotting with monospecific 
polyclonal antibodies against CS1 or CS3. CS1 and CS3 
pili were expressed equally well in four strains (Figure 
5) . 

A CFAII-negative derivative of E1392/75/2A was 
constructed for use as a control. This was done by 
specific curing of the CS encoding plasmids from ETEC 
strain E1392/75-2A. A short fragment of DNA was amplified 
from the cooB gene using PCR with oligonucleotides CSA01 
and CSA02 as primers and ligated into pGEM-T Easy plasmid 
vector (Trade Name, Promega) designed for the cloning of 
PCR products. The fragment was subcloned into pCVD442 by 
virtue of the Sail and SphI restriction enzyme sites. The 
pCVD442-cooB derivative was introduced into ETEC strain 
E1392/75/2A by conjugation from SMIOApir. Ampicillin 
resistant transcon j ugants are most likely to be the 
result of fusion of the pCVD4 42-cooB derivative with 
cooB-bearing plasmid. Such transcon j ugates were then 
grown on L-agar supplemented with 5% sucrose to select 
for loss of the sacB gene of pCVD442. Resulting colonies 
were tested for ampicillin sensitivity, and by PCR using 
CSA01 and CSA02 as primers'. Three colonies of E1392/75/2A 



-26- 

were included as positive controls among these PCRs . Two 
sucrose resistant colonies that gave no product with the 
PCR were streaked out onto fresh L-agar supplemented with 
5% sucrose to obtain pure cultures. These were then grown 
in L-broth at 37°C for approximately 16 h and microbanked 
at -70°C. Loss of the CS1 encoding plasmid was confirmed 
by analysis of the plasmid profiles of the derivatives 
using agarose gel electrophoresis. Two derivatives were 
confirmed as CS1 negative, but were still CS3+. 



-27- 



Table 1 - PCR primers 



Name 


Target 


Use 


Sequence ( 5 1 -3 ' ) 


TT 1 


.-—i m 7—1 IT' 
{JlilfcJ £ 


rri-Xiiitri. t\ nj x CLonxiiy 


ATP TPT TTT* TTT 7\ C C 
nlL, 1 bl lib lib AbL 

TCA GCA ATC TAT TTG 
CAA CC 


TT 9 


s-\ m 7™i J* 1 


Jr 1. XHlt; JL O 1 UI cxoiixiiy 


TTT TTT CCC Ts.CC 7\ T 1 r~* 
111 111 bbb HbU /-lib 

CCG GCA GCC ACG CGT 
AGT G 


TT ? 


omjplT 


IT 1. Xlllfci -L XOX CXOIlXIiy 


pTp r* 7\ r: rpT TZ\f^ CTC 
uH- bno bb 1 1 no i^l t 

TAT TTA TTA CCC TCA 
TGG 


TT /I 
1 1 <± 


oitipF 1 


rrrxiuer u ior cionmy 


CUC PTZ1 ~LCC CTC CAC 
bHb L1H Abb bib bnb 

TAA TAG CAC ACC TCT 
TTG 


TT7 


ompC 


xrrxmer a xor cioning 


fpipp CTC rZV2\ 1\CT CC1\ 
lib bib \Jt\t\ nbl Lb/i 

CGG ATG TTA ATT ATT 
TGT G 


TT fi 
lit) 


ontpC 


Primer 1 B for cloning 


CCC C 7\ 7\ 1\ C C CCH CCT 
bbb bi\.H. Abb bbA bb 1 

CAT TCA CCA GCG GCC 
CGA CG 


TT Q 

1 1 _7 


ompC 


riimer b ior cxonxng 


CCT Zl Z\ C CCT CC1\ i^TZ\ 
bb 1 Art.b bb 1 bbA b 1 r\ 

ATC TCG ATT GAT ATC 
CG 


tti n 

1 1 _L u 


ompC 


rrrxiuex u ior cxonxng 


CTC CUC CCT T&CZ CCT 
bib bnb bb 1 1 nb v^ol 

TAT TAA CCC TCT GTT 
A 


TT 1 Q 
1 1 J. 


cL rot 


riiniei. xor cxonxng 


CCC CCC TCC CTC 
bbb bbb 1 bb bib 

TAG AGT GAA CTG ATC 
AAC AAT A 


TT20 


aroC 


Primer B for cloning 


ATG CGC GCG AGA GCT 
CAA CCA GCG TCG CAC 
TTT G 
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TT21 


aroC 


Primer C for cloning 


CTC GAG GCA TGC TGA 
ATA AAA CCG CGA TTG 


TT22 


aroC 


Primer D for cloning 


GCA TGC CCT CGA GGG 
CTCC GTT ATT GTT 
GTG 


MGR169 


CS1 


Binds in CS1 sequence 


TGA TTC CCT TTG TTG 
CGA AGG CGA A 


MGR170 


CS1 


Binds in CS1 sequence 


ATT AAG ATA CCC AAG 
TAA TAC TCA A 


LT-Rl 


LT-AB 


See text 


GCT TTT AAA GGA TCC 
TAG TT | 


LT-03 


LT-AB 


See text 


GGT TAT CTT TCC GGA 
TTG TC 


EST01 


ST 


See text 


CAT GTT CCG GAG GTA 
ATA TGA A 


EST02 


ST 


See text 


AGT TCC CTT TAT ATT 
ATT AAT A 


CSA01 


CS1 


See text 


TGG AGT TTA TAT GAA 
ACT AA 


CSA02 


CS1 


See text 


TGA CTT AGT CAG GAT 
AAT TG 


CS3-01 


CS3 


See text 


ATA CTT ATT AAT AGG 
TCT TT 


CS3-02 


CS3 


See text 


TTG TCG AAG TAA TTG 
TTA TA 



15 
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Table 2 



Target gene 


Sites used for 
cloning into 
pCVD4 4 2 


Sites introduced 
for screening 
purposes 




Site 1 


Site 2 


Site 3 


Site 4 


aroC 


Xbal 


SacI 


Xhol 


SphI 


htrA 


Sail 


SphI 


Xhol 


Xbal 


ompC 


Sail 


SacI 


BlpI 


Xhol 


ompF 


SacI 


SphI 


BlpI 


Xhol 


ompR 


Sail 


SacI 


BlpI 


SphI 



10 



15 



20 



25 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: PEPTIDE THERAPEUTICS LIMITED 

(B) STREET: 321 Cambridge Science Park. Milton Road 

(C) CITY: Cambridge 

(D) STATE: Cambridgeshire 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : CB4 4WG 

(ii) TITLE OF INVENTION: ATTENUATED BACTERIA USEFUL IN VACCINES 
(iii) NUMBER OF SEQUENCES: 6 

• (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: aroC of E.coli 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 492.. 1562 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCGACGCGG TGGATATCTC TCCAGACGCG CTGGCGGTTG CTGAACAGAA CATCGAAGAA 
CACGGTCTGA TCCACAACGT CATTCCGATT CGTTCCGATC TGTTCCGCGA CTTGCCGAAA 
GTGCAGTACG ACCTGATTGT CACTAACCCG CCGTATGTCG ATGCGAAGAT ATGTCCGACC 
TGCCAAACAA TACCGCCACG AGCCGGAACT GGGCCTGGCA TCTGGCACTG ACGGCCTGAA 
ACTGACGCGT CGCATTCTCG GTAACGCGGC AGATTACCTT GCTGATGATG GCGTGTTGAT 
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TTGTGAAGTC GGCAACAGCA TGGTACATCT TATGGAACAA TATCCGGATG TTCCGTTCAC 360 

CTGGCTGGAG TTTGATAACG GCGGCGATGG TGTGTTTATG CTCACCAAAG AGCAGCTTAT 420 

5 TGCCGCACGA GAACATTTCG CGATTTATAA AGATTAAGTA AACACGCAAA CACAACAATA 480 

ACGGAGCCGT G ATG GCT GGA AAC ACA ATT GGA CAA CTC TTT CGC GTA ACC 530 
Met Ala Gly Asn Thr He Gly Gin Leu Phe Arg Val Thr 
15 10 

10 

ACC TTC GGC GAA TCG CAC GGG CTG GCG CTC GGC TGC ATC GTC GAT GGT 578 
Thr Phe Gly Glu Ser His Gly Leu Ala Leu Gly Cys He Val Asp Gly 
15 20 25 

15 GTT CCG CCA GGC ATT CCG CTG ACG GAA GCG GAC CTG CAA CAT GAC CTC 626 
Val Pro Pro Gly He Pro Leu Thr Glu Ala Asp Leu Gin His Asp Leu 
30 35 40 45 

GAC CGT CGT CGC CCT GGG ACA TCG CGC TAT ACC ACC CAG CGC CGC GAG 674 
20 Asp Arg Arg Arg Pro Gly Thr Ser Arg Tyr Thr Thr Gin Arg Arg Glu 

50 55 60 

CCG GAT CAG GTC AAA ATT CTC TCC GGT GTT TTT GAA GGC GTT ACT ACC 722 
Pro Asp Gin Val Lys He Leu Ser Gly Val Phe Glu Gly Val Thr Thr 
25 65 70 75 

GGC ACC AGC ATT GGC TTG TTG ATC GAA AAC ACT GAC CAG CGC TCT CAG 770 

Gly Thr Ser He Gly Leu Leu lie Glu Asn Thr Asp Gin Arg Ser Gin 
80 85 90 

30 

GAT TAC AGT GCG ATT AAG GAC GTT TTC CGT CCA GGC CAT GCC GAT TAC 818 

Asp Tyr Ser Ala lie Lys Asp Val Phe Arg Pro Gly His Ala Asp Tyr 
95 100 105 

35 ACC TAC GAA CAA AAA TAC GGT CTG CGC GAT TAT CGC GGC GGT GGA CGT 866 
Thr Tyr Glu Gin Lys Tyr Gly Leu Arg Asp Tyr Arg Gly Gly Gly Arg 
110 115 120 125 

TCT TCC GCC CGC GAA ACC GCC ATG CGC GTG GCG GCA GGA GCT ATT GCC 914 
40 Ser Ser Ala Arg Glu Thr Ala Met Arg Val Ala Ala Gly Ala He Ala 

130 135 140 

AAA AAA TAT CTC GCC GAG AAA TTT GGT ATT GAA ATC CGT GGC TGC CTG 962 
Lys Lys Tyr Leu Ala Glu Lys Phe Gly He Glu He Arg Gly Cys Leu 
4 5 145 150 155 

ACC CAG ATG GGC GAC ATT CCG CTG GAT ATC AAA GAC TGG TCG CAG GTC 1010 
Thr Gin Met Gly Asp He Pro Leu Asp He Lys Asp Trp Ser Gin Val 
160 165 170 

50 

GAG CAA AAT CCG TTT TTT TGC CCG GAC CCC GAC AAA ATC GAC GCG TTA 1058 
Glu Gin Asn Pro Phe Phe Cys Pro Asp Pro Asp Lys He Asp Ala Leu 
175 180 185 

55 GAC GAG TTG ATG CGT GCG CTG AAA AAA GAG GGC GAC TCC ATC GGC GCT 1106 
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Asp Glu Leu Met Arg Ala Leu Lys Lys Glu Gly Asp Ser He Gly Ala 
190 195 200 205 

AAA GTC ACC GTT GTT GCC AGT GGC GTT CCT GCC GGA CTT GGC GAG CCG 
Lys Val Thr Val Val Ala Ser Gly Val Pro Ala Gly Leu Gly Glu Pro 
210 215 220 

GTC TTT GAC CGC CTG GAT GCT GAC ATC GCC CAT GCG CTG ATG AGC ATC 
Val Phe Asp Arg Leu Asp Ala Asp lie Ala His Ala Leu Met Ser He 
225 230 235 

AAC GCG GTG AAA GGC GTG GAA ATT GGC GAC GGC TTT GAC GTG GTG GCG 
Asn Ala Val Lys Gly Val Glu He Gly Asp Gly Phe Asp Val Val Ala 
240 245 250 

CTG CGC GGC AGC CAG AAC CGC GAT GAA ATC ACC AAA GAC GGT TTC CAG 
Leu Arg Gly Ser Gin Asn Arg Asp Glu He Thr Lys Asp Gly Phe Gin 
255 260 265 

AGC AAC CAT GCG GGC GGC ATT CTC GGC GGT ATC AGC AGC GGG CAG CAA 
Ser Asn His Ala Gly Gly He Leu Gly Gly lie Ser Ser Gly Gin Gin 
270 275 280 285 

ATC ATT GCC CAT ATG GCG CTG AAA CCG ACC TCC AGC ATT ACC GTG CCG 
He He Ala His Met Ala Leu Lys Pro Thr Ser Ser He Thr Val Pro 
290 295 300 



GGT CGT ACC ATT AAC CGC TTT GGC GAA GAA GTT GAG ATG ATC ACC AAA 
Gly Arg Thr He Asn Arg Phe Gly Glu Glu Val Glu Met He Thr Lys 
305 310 315 

GGC CGT CAC GAT CCC TGT GTC GGG ATC CGC GCA GTG CCG ATC GCA GAA 
Gly Arg His Asp Pro Cys Val Gly He Arg Ala Val Pro He Ala Glu 
320 325 330 

GCG AAT GCT GGC GAT CGT TTT AAT GGA TCA CCT GTT ACG GCA ACG GGC 
Ala Asn Ala Gly Asp Arg Phe Asn Gly Ser Pro Val Thr Ala Thr Gly 
335 340 345 

GCA AAA TGC CGA TGT GAA GAC TGA TATTCCACGC TGGTAAAAAA TGAATAAAAC 
Ala Lys Cys Arg Cys Glu Asp * 
350 355 

CGCGATTGCG CTGCTGGCTC TGCTTGCCAG TAGCGCCAGC CTGGCAGCGA CGCCGTGGCA 
AAAAATAACC CAACCTGTGC CGGGTAGCGC CAAATCGA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 
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(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

5 

Met Ala Gly Asn Thr He Gly Gin Leu Phe Arg Val Thr Thr Phe Gly 
15 10 15 

Glu Ser His Gly Leu Ala Leu Gly Cys He Val Asp Gly Val Pro Pro 
10 20 25' 30 

Gly He Pro Leu Thr Glu Ala Asp Leu Gin His Asp Leu Asp Arg Arg 
35 40 45 

15 Arg Pro Gly Thr Ser Arg Tyr Thr Thr Gin Arg Arg Glu Pro Asp Gin 
50 55 60 

Val Lys He Leu Ser Gly Val Phe Glu Gly Val Thr Thr Gly Thr Ser 
65 70 75 80 

He Gly Leu Leu He Glu Asn Thr Asp -Gin Arg Ser Gin Asp Tyr Ser 
85 90 95 



20 



Ala He Lys Asp Val Phe Arg Pro Gly His Ala Asp Tyr Thr Tyr Glu 
25 100 105 110 

Gin Lys Tyr Gly Leu Arg Asp Tyr Arg Gly Gly Gly Arg Ser Ser Ala 
115 120 125 

30 Arg Glu Thr Ala Met Arg Val Ala Ala Gly Ala He Ala Lys Lys Tyr 
130 135 140 

Leu Ala Glu Lys Phe Gly He Glu He Arg Gly Cys Leu Thr Gin Met 
145 150 155 160 



35 



50 



Gly Asp lie Pro Leu Asp He Lys Asp Trp Ser Gin Val Glu Gin Asn 
165 170 175 



Pro Phe Phe Cys Pro Asp Pro Asp Lys He Asp Ala Leu Asp Glu Leu 
4 0 180 185 190 

Met Arg Ala Leu Lys Lys Glu Gly Asp Ser He Gly Ala Lys Val Thr 
195 200 205 

4 5 Val Val Ala Ser Gly Val Pro Ala Gly Leu Gly Glu Pro Val Phe Asp 
210 215 220 

Arg Leu Asp Ala Asp He Ala His Ala Leu Met Ser He Asn Ala Val 
225 230 235 240 



Lys Gly Val Glu He Gly Asp Gly Phe Asp Val Val Ala Leu Arg Gly 
245 250 255 



Ser Gin Asn Arg Asp Glu He Thr Lys Asp Gly Phe Gin Ser Asn His 
55 260 265 270 
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10 



55 



Ala Gly Gly He Leu Gly Gly He Ser Ser Gly Gin Gin He He Ala 
275 280 285 

His Met Ala Leu Lys Pro Thr Ser Ser He Thr Val Pro Gly Arg Thr 
290 295 300 

He Asn Arg Phe Gly Glu Glu Val Glu Met He Thr Lys Gly Arg His 
305 310 315 320 



Asp Pro Cys Val Gly He Arg Ala Val Pro He Ala Glu Ala Asn Ala 
325 330 335 



Gly Asp Arg Phe Asn Gly Ser Pro Val Thr Ala Thr Gly Ala Lys Cys 
15 340 345 350 

Arg Cys Glu Asp * 
355 

2 0 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 base pairs 

(B) TYPE: nucleic acid 

2 5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: ompC of E.coli 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
35 (B) LOCATION: 491.. 1594 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

4 0 GTTAACAAGC GTTATAGTTT TTCTGTGGTA GCACAGAATA ATGAAAAGTG TGTAAAGAAG 60 

GGTAAAAAAA ACCGAATGCG AGGCATCCGG TTGAAATAGG GGTAAACAGA CATTCAGAAA 120 

TGAATGACGG TAATAAATAA AGTTAATGAT GATAGCGGGA GTTATTCTAG TTGCGAGTGA 180 

45 

AGGTTTTGTT TTGACATTCA GTGCTGTCAA ATACTTAAGA ATAAGTTATT GATTTTAACC 240 

TTGAATTATT ATTGCTTGAT GTTAGGTGCT TATTTCGCCA TTCCGCAATA ATCTTAAAAA 300 

50 GTTCCCTTGC ATTTACATTT TGAAACATCT ATAGCGATAA ATGAAACATC TTAAAAGTTT 360 

TAGTATCATA TTCGTGTTGG ATTATTCTGC ATTTTTGGGG AGAATGGACT TGCCGACTGA 420 

TTAATGAGGG TTAATCAGTA TGCAGTGGCA TAAAAAAGCA AATAAAGGCA TATAACAGAG 480 
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GGTTAATAAC ATG AAA GTT AAA GTA CTG TCC CTC CTG GTC CCA GCT CTG 
Met Lys Val Lys Val Leu Ser Leu Leu Val Pro Ala Leu 
360 365 370 

CTG GTA GCA GGC GCA GCA AAC GCT GCT GAA GTT TAC AAC AAA GAC GGC 
Leu Val Ala Gly Ala Ala Asn Ala Ala Glu Val Tyr Asn Lys Asp Gly 
375 380 385 

AAC AAA TTA GAT CTG TAC GGT AAA GTA GAC GGC CTG CAC TAT TTC TCT 
Asn Lys Leu Asp Leu Tyr Gly Lys Val Asp Gly Leu His Tyr Phe Ser 
390 395 400 

GAC AAC AAA GAT GTA GAT GGC GAC CAG ACC TAC ATG CGT CTT GGC TTC 
Asp Asn Lys Asp Val Asp Gly Asp Gin Thr Tyr Met Arg Leu Gly Phe 
405 410 415 

AAA GGT GAA ACT CAG GTT ACT GAC CAG CTG ACC GGT TAC GGC CAG TGG 
Lys Gly Glu Thr Gin Val Thr Asp Gin Leu Thr Gly Tyr Gly Gin Trp 
420 425 430 

GAA TAT CAG ATC CAG GGC AAC AGC GCT GAA AAC GAA AAC AAC TCC TGG 
Glu Tyr Gin He Gin Gly Asn Ser Ala Glu Asn Glu Asn Asn Ser Trp 
435 440 445 450 

ACC CGT GTG GCA TTC GCA GGT CTG AAA TTC CAG GAT GTG GGT TCT TTC 
Thr Arg Val Ala Phe Ala Gly Leu Lys Phe Gin Asp Val Gly Ser Phe 
455 460 465 

GAC TAC GGT CGT AAC TAC GGC GTT GTT TAT GAC GTA ACT TCC TGG ACC 
Asp Tyr Gly Arg Asn Tyr Gly Val Val Tyr Asp Val Thr Ser Trp Thr 
470 475 480 

GAC GTA CTG CCA GAA TTC GGT GGT GAC ACC TAC GGT TCT GAC AAC TTC 
Asp Val Leu Pro Glu Phe Gly Gly Asp Thr Tyr Gly Ser Asp Asn Phe 
485 490 495 

ATG CAG CAG CGT GGT AAC GGC TTC GCG ACC TAC CGT AAC ACT GAC TTC 
Met Gin Gin Arg Gly Asn Gly Phe Ala Thr Tyr Arg Asn Thr Asp Phe 
500 505 510 

TTC GGT CTG GTT GAC GGC CTG AAC TTT GCT GTT CAG TAC CAG GGT AAA 
Phe Gly Leu Val Asp Gly Leu Asn Phe Ala Val Gin Tyr Gin Gly Lys 
515 520 525 530 



AAC GGC AAC CCA TCT GGT GAA GGC TTT ACT AGT GGC GTA ACT AAC AAC 
Asn Gly Asn Pro Ser Gly Glu Gly Phe Thr Ser Gly Val Thr Asn Asn 
535 540 545 

GGT CGT GAC GCA CTG CGT CAA AAC GGC GAC GGC GTC GGC GGT TCT ATC 
Gly Arg Asp Ala Leu Arg Gin Asn Gly Asp Gly Val Gly Gly Ser He 
550 555 560 



ACT TAT GAT TAC GAA GGT TTC GGT ATC GGT GGT GCG ATC TCC AGC TCC 
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Thr Tyr Asp Tyr Glu Gly Phe Gly He Gly Gly Ala lie Ser Ser Ser 

565 570 575 

AAA CGT ACT GAT GCT CAG AAC ACC GCT GCT TAC ATC GGT AAC GGC GAC 

Lys Arg Thr Asp Ala Gin Asn Thr Ala Ala Tyr He Gly Asn Gly Asp 

580 585 590 

CGT GCT GAA ACC TAC ACT GGT GGT CTG AAA TAC GAC GCT AAC AAC ATC 

Arg Ala Glu Thr Tyr Thr Gly Gly Leu Lys Tyr Asp Ala Asn Asn He 

595 600 605 610 

TAC CTG GCT GCT CAG TAC ACC CAG ACC TAC AAC GCA ACT CGC GTA GGT 

Tyr Leu Ala Ala Gin Tyr Thr Gin Thr Tyr Asn Ala Thr Arg Val Gly 

615 620 625 

TCC CTG GGT TGG GCG AAC AAA GCA CAG AAC TTC GAA GCT GTT GCT CAG 

Ser Leu Gly Trp Ala Asn Lys Ala Gin Asn Phe Glu Ala Val Ala Gin 

630 635 640 

TAC CAG TTC GAC TTC GGT CTG CGT CCG TCC CTG GCT TAC CTG CAG TCT 

Tyr Gin Phe Asp Phe Gly Leu Arg Pro Ser Leu Ala Tyr Leu Gin Ser 

645 650 655 

AAA GGT AAA AAC CTG GGT CGT GGC TAC GAC GAC GAA GAT ATC CTG AAA 

Lys Gly Lys Asn Leu Gly Arg Gly Tyr Asp Asp Glu Asp lie Leu Lys 

660 665 670 

TAT GTT GAT GTT GGT GCT ACC TAC TAC TTC AAC AAA AAC ATG TCC ACC 

Tyr Val Asp Val Gly Ala Thr Tyr Tyr Phe Asn Lys Asn Met Ser Thr 

675 680 685 690 

TAC GTT GAC TAC AAA ATC AAC CTG CTG GAC GAC AAC CAG TTC ACT CGT 

Tyr Val Asp Tyr Lys He Asn Leu Leu Asp Asp Asn Gin Phe Thr Arg 

695 700 705 



GAC GCT GGC ATC AAC ACT GAT AAC ATC GTA GCT CTG GGT CTG GTT TAC 
Asp Ala Gly lie Asn Thr Asp Asn He Val Ala Leu Gly Leu Val Tyr 
710 715 720 

CAG TTC TAA TCTCGATTGA TATCGAACAA GGGCCTGCGG GCCCI 1 1 1 I I 
Gin Phe * 
725 

CATTGTTTTC AGCGTACAAA CTCAGTTTTT TGGTGTACTC TTGCGACCGT TCGCATGAGG 
ATAATCACGT ACGGAAATA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5 

Met Lys Val Lys Val Leu Ser Leu Leu Val Pro Ala Leu Leu Val Ala 
15 10 15 

Gly Ala Ala Asn Ala Ala Glu Val Tyr Asn Lys Asp Gly Asn Lys Leu 

10 20 25' 30 

Asp Leu Tyr Gly Lys Val Asp Gly Leu His Tyr Phe Ser Asp Asn Lys 
35 40 45 

15 Asp Val Asp Gly Asp Gin Thr Tyr Met Arg Leu Gly Phe Lys Gly Glu 
50 55 60 

Thr Gin Val Thr Asp Gin Leu Thr Gly Tyr Gly Gin Trp Glu Tyr Gin 
65 70 75 80 

20 

He Gin Gly Asn Ser Ala Glu Asn Glu Asn Asn Ser Trp Thr Arg Val 
85 90 95 

Ala Phe Ala Gly Leu Lys Phe Gin Asp Val Gly Ser Phe Asp Tyr Gly 

2 5 100 105 110 

Arg Asn Tyr Gly Val Val Tyr Asp Val Thr Ser Trp Thr Asp Val Leu 
115 120 125 

3 0 Pro Glu Phe Gly Gly Asp Thr Tyr Gly Ser Asp Asn Phe Met Gin Gin 

130 135 140 

Arg Gly Asn Gly Phe Ala Thr Tyr Arg Asn Thr Asp Phe Phe Gly Leu 
145 150 155 160 

35 

Val Asp Gly Leu Asn Phe Ala Val Gin Tyr Gin Gly Lys Asn Gly Asn 
165 170 175 

Pro Ser Gly Glu Gly Phe Thr Ser Gly Val Thr Asn Asn Gly Arg Asp 

4 0 180 185 190 

Ala Leu Arg Gin Asn Gly Asp Gly Val Gly Gly Ser He Thr Tyr Asp 
195 200 205 

4 5 Tyr Glu Gly Phe Gly He Gly Gly Ala He Ser Ser Ser Lys Arg Thr 
210 215 220 

Asp Ala Gin Asn Thr Ala Ala Tyr He Gly Asn Gly Asp Arg Ala Glu 
225 230 235 240 

50 

Thr Tyr Thr Gly Gly Leu Lys Tyr Asp Ala Asn Asn He Tyr Leu Ala 
245 250 255 

Ala Gin Tyr Thr Gin Thr Tyr Asn Ala Thr Arg Val Gly Ser Leu Gly 
55 260 265 t 270 
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Trp Ala Asn Lys Ala Gin Asn Phe Glu Ala Val Ala Gin Tyr Gin Phe 
275 280 285 




5 


Asp Phe Gly Leu Arg Pro Ser Leu Ala Tyr Leu Gin Ser Lys Gly Lys 
290 295 300 

Asn Leu Gly Arg Gly Tyr Asp Asp Glu Asp He Leu Lys Tyr Val Asp 
305 310 315 320 




10 


Val Gly Ala Thr Tyr Tyr Phe Asn Lys Asn Met Ser Thr Tyr Val Asp 
325 330 335 




15 


Tyr Lys He Asn Leu Leu Asp Asp Asn Gin Phe Thr Arg Asp Ala Gly 
340 345 350 

lie Asn Thr Asp Asn He Val Ala Leu Gly Leu Val Tyr Gin Phe * 
355 360 365 




20 


(2) INFORMATION FOR SEQ ID NO: 5: 




25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




30 


(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: ompF of E.coli 




35 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 457. .1545 




40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AAAACTAATC CGCATTCTTA TTGCGGATTA Gl 1 1 1 1 ICI 1 AGCIAAIAGC ACAAI 1 1 ICA 


60 




TACTAI 1 1 1 1 IGGCAI ICIG GAIGICIGAA AGAAGAI 1 1 1 GTGCCAGGTC GATAAAGTTT 


120 


45 


CCATCAGAAA CAAAATTTCC GTTTAGTTAA TTTAAATATA AGGAAATCAT ATAAATAGAT 


180 




TAAAATTGCT GTAAATATCA TCACGTCTCT ATGGAAATAT GACGGTGTTC ACAAAGTTCC 


240 


50 


TTAAAI IMA CI 1 1 IGGI IA CAIAI 1 1 1 1 1 CTTTTTGAAA CCAAATCTTT ATCTTTGTAG 
CACTTTCACG GTAGCGAAAC GTTAGTTTGA ATGGAAAGAT GCCTGCAGAC ACATAAAGAC 


300 
360 




ACCAAACTCT CATCAATAGT TCCGTAAATT TTTATTGACA GAACTTATTG ACGGCAGTGG 


420 


55 


CAGGTGTCAT AAAAAAAACC ATGAGGGTAA TAAATA ATG ATG AAG CGC AAT ATT 


474 
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Met Met Lys Arg Asn lie 
1 5 



CTG GCA GTG ATC GTC CCT GCT CTG TTA GTA GCA GGT ACT GCA AAC GCT 
Leu Ala Val He Val Pro Ala Leu Leu Val Ala Gly Thr Ala Asn Ala 
10 15 20 

GCA GAA ATC TAT AAC AAA GAT GGC AAC AAA GTA GAT CTG TAC GGT AAA 
Ala Glu He Tyr Asn Lys Asp Gly Asn Lys Val Asp Leu Tyr Gly Lys 
25 30 ' 35 

GCT GTT GGT CTG CAT TAT TTT TCC AAG GGT AAC GGT GAA AAC AGT TAC 
Ala Val Gly Leu His Tyr Phe Ser Lys Gly Asn Gly Glu Asn Ser Tyr 
40 45 50 

GGT GGC AAT GGC GAC ATG ACC TAT GCC CGT CTT GGT TTT AAA GGG GAA 
Gly Gly Asn Gly Asp Met Thr Tyr Ala Arg Leu Gly Phe Lys Gly Glu 
55 60 65 70 

ACT CAA ATC AAT TCC GAT CTG ACC GGT TAT GGT CAG TGG GAA TAT AAC 
Thr Gin lie Asn Ser Asp Leu Thr Gly Tyr Gly Gin Trp Glu Tyr Asn 
75 .80 85 

TTC CAG GGT AAC AAC TCT GAA GGC GCT GAC GCT CAA ACT GGT AAC AAA 
Phe Gin Gly Asn Asn Ser Glu Gly Ala Asp Ala Gin Thr Gly Asn Lys 
90 95 100 

ACG CGT CTG GCA TTC GCG GGT CTT AAA TAC GCT GAC GTT GGT TCT TTC 
Thr Arg Leu Ala Phe Ala Gly Leu Lys Tyr Ala Asp Val Gly Ser Phe 
105 110 115 

GAT TAC GGC CGT AAC TAC GGT GTG GTT TAT GAT GCA CTG GGT TAC ACC 
Asp Tyr Gly Arg Asn Tyr Gly Val Val Tyr Asp Ala Leu Gly Tyr Thr 
120 125 130 

GAT ATG CTG CCA GAA TTT GGT GGT GAT ACT GCA TAC AGC GAT GAC TTC 
Asp Met Leu Pro Glu Phe Gly Gly Asp Thr Ala Tyr Ser Asp Asp Phe 
135 140 145 150 

TTC GTT GGT CGT GTT GGC GGC GTT GCT ACC TAT CGT AAC TCC AAC TTC 
Phe Val Gly Arg Val Gly Gly Val Ala Thr Tyr Arg Asn Ser Asn Phe 
155 160 165 

TTT GGT CTG GTT GAT GGC CTG AAC TTC GCT GTT CAG TAC CTG GGT AAA 
Phe Gly Leu Val Asp Gly Leu Asn Phe Ala Val Gin Tyr Leu Gly Lys 
170 175 180 

AAC GAG CGT GAC ACT GCA CGC CGT TCT AAC GGC GAC GGT GTT GGC GGT 
Asn Glu Arg Asp Thr Ala Arg Arg Ser Asn Gly Asp Gly Val Gly Gly 
185 190 195 

TCT ATC AGC TAC GAA TAC GAA GGC TTT GGT ATC GTT GGT GCT TAT GGT 
Ser He Ser Tyr Glu Tyr Glu Gly Phe Gly He Val Gly Ala Tyr Gly 
200 205 210 
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GCA GCT GAC CGT ACC AAC CTG CAA GAA GCT CAA CCT CTT GGC AAC GGT 
Ala Ala Asp Arg Thr Asn Leu Gin Glu Ala Gin Pro Leu Gly Asn Gly 
215 220 225 230 

AAA AAA GCT GAA CAG TGG GCT ACT GGT CTG MG TAC GAC GCG AAC AAC 
Lys Lys Ala Glu Gin Trp Ala Thr Gly Leu Lys Tyr Asp Ala Asn Asn 
235 240 245 

ATC TAC CTG GCA GCG AAC TAC GGT GAA ACC CGT AAC GCT ACG CCG ATC 
He Tyr Leu Ala Ala Asn Tyr Gly Glu Thr Arg Asn Ala Thr Pro He 
250 255 260 

ACT AAT AAA TTT ACA AAC ACC AGC GGC TTC GCC AAC AAA ACG CAA GAC 
Thr Asn Lys Phe Thr Asn Thr Ser Gly Phe Ala Asn Lys Thr Gin Asp 
265 270 275 

GTT CTG TTA GTT GCG CAA TAC CAG TTC GAT TTC GGT CTG CGT CCG TCC 
Val Leu Leu Val Ala Gin Tyr Gin Phe Asp Phe Gly Leu Arg Pro Ser 
280 285 290 

ATC GCT TAC ACC AAA TCT AAA GCG AAA GAC GTA GAA GGT ATC GGT GAT 
He Ala Tyr Thr Lys Ser Lys Ala Lys Asp Val Glu Gly He Gly Asp 
295 300 305 310 

GTT GAT CTG GTG AAC TAC TTT GAA GTG GGC GCA ACC TAC TAC TTC AAC 
Val Asp Leu Val Asn Tyr Phe Glu Val Gly Ala Thr Tyr Tyr Phe Asn 
315 320 325 

AAA AAC ATG TCC ACC TAT GTT GAC TAC ATC ATC AAC CAG ATC GAT TCT 
Lys Asn Met Ser Thr Tyr Val Asp Tyr He He Asn Gin He Asp Ser 
330 335 340 

GAC AAC AAA CTG GGC GTA GGT TCA GAC GAC ACC GTT GCT GTG GGT ATC 
Asp Asn Lys Leu Gly Val Gly Ser Asp Asp Thr Val Ala Val Gly He 
345 350 355 

GTT TAC CAG TTC TAA TAGCACACCT CTTTGTTAAA TGCCGAAAAA ACAGGACTTT 
Val Tyr Gin Phe * 
360 

GGTCCTGTTT TTTTTATACC TTCCAGAGCA ATCTCACGTC TTGCAAAAAC AGCCTGCGTT 
TTCATCAGTA ATAGTTGGAA TTTTGTAAAT CTCCCGTTAC CCTGATAGCG GACTTCCCTT 
CTGTAACCAT AATGGAACCT CGTCATGTTT GAGAACATTA CCGCCGCTCC TGCCGACCCG 
ATTCTGGGCC TGGCCGATCT GTTTCGTGCC GATGAACGTC CCG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Met Lys Arg Asn He Leu Ala Val He Val Pro Ala Leu Leu Val 
15 10 15 

Ala Gly Thr Ala Asn Ala Ala Glu He Tyr Asn Lys Asp Gly Asn Lys 
20 25 30 

Val Asp Leu Tyr Gly Lys Ala Val Gly Leu His Tyr Phe Ser Lys Gly 
35 40 45 

Asn Gly Glu Asn Ser Tyr Gly Gly Asn Gly Asp Met Thr Tyr Ala Arg 
50 55 60 

Leu Gly Phe Lys Gly Glu Thr Gin He Asn Ser Asp Leu Thr Gly Tyr 
65 70 75 80 

Gly Gin Trp Glu Tyr Asn Phe Gin Gly Asn Asn Ser Glu Gly Ala Asp 
85 90 95 

Ala Gin Thr Gly Asn Lys Thr Arg Leu Ala Phe Ala Gly Leu Lys Tyr 
100 105 110 

Ala Asp Val Gly Ser Phe Asp Tyr Gly Arg Asn Tyr Gly Val Val Tyr 
115 120 125 

Asp Ala Leu Gly Tyr Thr Asp Met Leu Pro Glu Phe Gly Gly Asp Thr 
130 135 140 

Ala Tyr Ser Asp Asp Phe Phe Val Gly Arg Val Gly Gly Val Ala Thr 
145 150 155 160 

Tyr Arg Asn Ser Asn Phe Phe Gly Leu Val Asp Gly Leu Asn Phe Ala 
165 170 175 

Val Gin Tyr Leu Gly Lys Asn Glu Arg Asp Thr Ala Arg Arg Ser Asn 
180 185 190 

Gly Asp Gly Val Gly Gly Ser He Ser Tyr Glu Tyr Glu Gly Phe Gly 
195 200 205 

He Val Gly Ala Tyr Gly Ala Ala Asp Arg Thr Asn Leu Gin Glu Ala 
210 215 220 

Gin Pro Leu Gly Asn Gly Lys Lys Ala Glu Gin Trp Ala Thr Gly Leu 
225 230 235 240 

Lys Tyr Asp Ala Asn Asn lie Tyr Leu Ala Ala Asn Tyr Gly Glu Thr 
245 250 255 

Arg Asn Ala Thr Pro He Thr Asn Lys Phe Thr Asn Thr Ser Gly Phe 
260 265 270 
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Ala Asn Lys Thr Gin Asp Val Leu Leu Val Ala Gin Tyr Gin Phe Asp 
275 280 285 

Phe Gly Leu Arg Pro Ser He Ala Tyr Thr Lys Ser Lys Ala Lys Asp 
290 295 300 

Val Glu Gly He Gly Asp Val Asp Leu Val Asn Tyr Phe Glu Val Gly 
305 310 315 320 

Ala Thr Tyr Tyr Phe Asn Lys Asn Met Ser Thr Tyr Val Asp Tyr He 
325 330 335 

lie Asn Gin He Asp Ser Asp Asn Lys Leu Gly Val Gly Ser Asp Asp 
340 345 350 

Thr Val Ala Val Gly He Val Tyr Gin Phe * 
355 360 
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CLAIMS 



A bacterium attenuated by a non-reverting mutation 
in each of the aroC gene, the ompF gene and the 
ompC gene . 

A bacterium according to claim 1 which infects by 
the oral route . 

A bacterium according to claim 1 or 2 which is from 
the genera Escherichia , Salmonella , Vibrio, 
Haemophilus , Neisseria , Yersinia , Bordetella or 
Brucella . 

A bacterium according to claim 3 which is a strain 
of Escherichia coli, Salmonella typhimurium, 
Salmonella typhi, Salmonella enteritidis , 
Salmonella choleraesuis , Salmonella dublin, 
Haemophilus influenzae, Neisseria gonorrhoeae , 
Yersinia enterocolitica , Bordetella pertussis or 
Brucella abortus . 

A bacterium according to claim 4 which is a strain 
of enterotoxigenic E.coli (ETEC. 

A bacterium according to any one of the preceding 
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claims which is further attenuted by a mutation in 
a fourth gene. 

A bacterium according to claim 6 wherein the fourth 
gene is aroA, aroE, pur, htrA, galE, cya , crp, phoP 
or surA. 

A bacterium according to any one of the preceding 
claims, wherein the mutation in each gene is a 
defined mutation. 

A bacterium according to any one of the preceding 
claims, wherein the mutation in each gene is 
deletion of the entire coding sequence. 

A bacterium according to any one of the preceding 
claims which has been genetically engineered to 
express a heterologous antigen. 

A bacterium according to claim 10, wherein 
expression of the antigen is driven by the nirB 
promoter or the htrA promoter. 

A vaccine comprising a bacterium as defined in any 
one of the preceding claims and a pharmaceutically 
acceptable carrier or diluent. 
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A bacterium as defined in any one of claims 1 to 11 
for use in a method of vaccinating a human or 
animal . 

Use of a bacterium as defined in any one of claims 
1 to 11 for the manufacture of a medicament for 
vaccinating a human or animal. 

A method of raising an immune response in a 
mammalian host, which comprises administering to 
the host a bacterium attenuated by a non-reverting 
mutation in each of the aroC gene, the ompF gene 
and the ompC gene. 
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ABSTRACT 



ATTENUATED BACTERIA USEFUL IN VACCINES 

The invention provides a bacterium attenuated by a non- 
reverting mutation in each of the aroC gene, the ompF 
gene and the ompC gene. The bacterium is useful as a 
vaccine. The bacterium may, for example, be an attenuated 
strain of E.coli useful in vaccination against diarrhoea. 
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Figure 1 



Primer A, includes restriction site 1 

Primer B, includes restriction site 2 
Primer C, includes restriction sites 3 and 4 
Primer D, includes restriction sites 3 and 4 



ETEC chromosome 




-500 bp 



Target gene 



-500 bp 



PCR amplify target gene using 
primer pairs A/C and B/D 



Gel purify PCR products 



Amplify full length deletion cassette using 
equal amounts of PCR products 1 and 2 
as template and primers A and B 



Clone deletion cassette into pCVD442 
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Figure 4 
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Figure 5 
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